Using linear interpolation to improve histogram equalization for speech recognition
نویسندگان
چکیده
This paper presents a novel approach to speech data normalization by introducing interpolation for histogram equalization. We study different ways of histogram interpolation that inhence this normalization technique. We found that using a special weighting factor to combine current and past test sentence statistics improved speech recognition performance. For the testing that used weighted histogram interpolation we achieved 44.85% phone error rate against 49.42% phone error rate for the testing without normalization and 48.59% phone error rate, when only a single test sentence histogram was used for normalization. Recognition experiments were conducted on speech data recorded in a moving car and proved advantage of using interpolation for data normalization by histogram equalization.
منابع مشابه
Histogram Equalization to Model Adaptation for Robust Speech Recognition
We propose a new model adaptation method based on the histogram equalization technique for providing robustness in noisy environments. The trained acoustic mean models of a speech recognizer are adapted into environmentally matched conditions by using the histogram equalization algorithm on a single utterance basis. For more robust speech recognition in the heavily noisy conditions, trained aco...
متن کاملClassification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملRobust Spelling and Digit Recognition in the Car: Switching Models and Their Like
Performance of speech recognition systems strongly degrades in the presence of background noise, like the driving noise in the interior of a car. We aim to improve noise robustness focusing on all major levels of speech recognition: feature extraction, feature enhancement, and speech modeling. Different auditory modeling concepts, speech enhancement techniques, training strategies, and model ar...
متن کاملHistogram Equalization Based Front-end Processing for Noisy Speech Recognition
In this paper, we present Gabor features extraction based on front-end processing using histogram equalization for noisy speech recognition. The proposed features named as Histogram Equalization of Gabor Bark Spectrum features, HeqGBS features are extracted using 2-D Gabor processing followed by a histogram equalization step from spectro-temporal representation of Bark spectrum of speech signal...
متن کاملA new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering
In this paper, a new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering is proposed. The progressive histogram equalization (PHEQ) performs the histogram equalization (HEQ) progressively with respect to a reference interval which moves with the present frame to be processed. The multi-eigenvector temporal ...
متن کامل